CDS
Accession Number | TCMCG075C13095 |
gbkey | CDS |
Protein Id | XP_017975023.1 |
Location | complement(join(21755182..21756204,21756641..21756766,21756853..21757017,21757106..21757240,21757336..21757646,21758583..21758721,21758900..21758996,21759099..21759196,21759695..21759760,21760784..21761245,21761367..21761570,21761736..21761864,21761950..21762036,21762177..21762233,21762633..21762701,21762806..21762950,21763532..21763755,21766829..21766922,21769219..21769286,21769388..21769444,21769534..21769633,21769781..21769830,21769956..21770105,21770743..21770899,21776831..21776919,21777165..21777269,21777356..21777424,21777498..21777620,21777768..21777836,21778484..21778577,21778800..21778886,21779028..21779195,21779342..21779452,21780718..21780824,21780949..21781081,21781241..21781344,21783976..21784101,21784437..21784505,21785975..21786055,21786217..21786306,21797294..21797398,21797573..21797637,21797935..21798022,21798099..21798188,21798306..21798542,21798953..21799059,21802645..21802852,21803543..21803710)) |
Gene | LOC18602072 |
GeneID | 18602072 |
Organism | Theobroma cacao |
Protein
Length | 2301aa |
Molecule type | protein |
Topology | linear |
Data_file_division | PLN |
dblink | BioProject:PRJNA341501 |
db_source | XM_018119534.1 |
Definition | PREDICTED: HEAT repeat-containing protein 5B isoform X2 [Theobroma cacao] |
EGGNOG-MAPPER Annotation
Sequence
CDS: ATGGCGAGGAGGAATTACGTGAGAGAGAACGTTCCTCTCTCACGCTTCGGCGTCTTGGTTGCTCAGCTTGAGTCCATCGTCGCTTCCGCATCACAGAAATCTCCCGACCCTCTCCTTTGCTTCGATCTCCTCTCCGATCTCCTTTCCGCCCTCGACGACGAGCCCAAGGAATCTATCTTGTTGTGGCAAAGAAAATGTGAGGATGCTTTGTATTCCTTGCTTATTCTTGGTGCTAAACGACCTGTACGTCATTTGGCATCCGTAGCAATGGGAAGGATAATATCTAAAGGAGATAGCATTTCAATATACTCAAGAGCAAGTAGTCTCCAAGGGTTTCTTTCGGATGGGAAGCGGAGTGAGCCACAGAGAATTGCTGGTGCTGCACAATGCTTGGGCGAGTTATATCGACATTTTGGAAGAAGAATAACTTCGGGTTTGCTTGAAACAACTATTATTGCAACAAAACTCATGAAGTTTCATGAGGAGTTTGTAAGACAAGAGGCTTTGCTCATGCTTCAGAATGCTTTGGTAGGCTCTGGTGGTAGTGCTGCTGCTTCAGCATACACTGAGGCATTCCGTCTCATTACAAGATTTGCCATTGGAGACAAAGCATTTGTTGTTAGAATAGCTGCAGCACGTTGTCTGAAGGCTTTTGCCAACATAGGAGGACCAGGTCTAGGGGTTGGAGAACTTGACAGTTTAGCCTCAAATTGTGTCAAGGCTCTTGAGGATCCCATAACTTCTGTTCGGGATGCGTTTGCTGAAGCTCTGGGCTCATTAATTGCTCTTGGAATGAATCCTGAGGCACAGGTTCAACCAAGGGGAAAAGGCCCTTTTCCTCCAGCGAAGAAACTAGAAGGTGGTTTGCAGAGGCATCTGGCTTTGCCCTTCACAAAAGCAAGCAGTATTCGGTCAAAGGACATTCGAGTAGGCCTAACATTATCTTGGGTGTTCTTTTTACAGGCCATTCGTCTGAAGTATCTACATCCAGATATTGAACTTCAAAATTATGCTTTGAACGTTATGGACATGCTTCGTATGGATATGTCTGTTGATGCCCATGCACTGGCATGTGTTCTCTATATCCTTCGTGTTGGTGTAACTGATCAGATGACAGAACCTACTCAAAGGAGCTTTACAGTTTTTCTTGGAAAGCAGCTTCAATCTCCAGAGGCTAGCCCTTCCATGAAAATTGCTGCTCTACGTACACTATCATATACTTTGAAAACTTTAGGAGAGGTTCCACATGAATTCAAGGAAGTTTTGGATAACACAGTTGTTGCAGCAGTATCTCACTCTGCCCAGCTTGTACGTGTTGAGGCTGCTTTAACATTGCGTGCATTGGCTGAGGTTGATCCCACTTGTGTTGGTGGTTTGATATCTTATGGGGTGACCACACTGAATGCTTTAAGGGAAAGTGTTTCCTTTGAAAAGGGAAGCAATTTGAAAGTTGAGCTTGATTCTTTGCATGGGCAAGCAACAGTTTTGGCTGCTTTAGTTTCCATTTCACCAAAATTACCTCTTGGTTATCCAGCTAGATTGCCCAAGTCAGTTCTTGAAGTTTCAAGGAAAATGCTGACTGAATTTAGTAGAAATGCTGCTACTGCCATGGTGGAAGAAGAAGCTGGATGGTTACTCTTATCATCATTACTATCTGCTATGCCAAAGGAGGAGCTTGAGGATCAGGTCTTTGATATTCTTTCCTTGTGGGCTGACCTTTTCAGTGGAAACCCAGAAGATGTTATTAGACAAAGTGGAGATTTACAATCTAGGATTCGTGTGTGGTCTGCAGCAATTGATGCACTTACATCATTTGTACGATGCTTTGTTTCATCCAATTCGACAATTAGTGGGATTTTACTTCAACCAGTGATTCTATATCTCAATAGGGCTTTGTCCTATATCTCTCTGTTGGCAGCCAAAGAACAACCAAATATTAAGCCTGCAATGGACGTATTCATTATCAGAACGCTAATGGCCTATCAGTCCCTTCCTGATCCTATGGCCTATAGGAGTGACCATTCTCGGATTATTCAACTATGCACAGTTCCCTATAGAAATGCTTCTGGATGTGAGGAAAGTTCATGCTTAAGGTTCCTGTTAGACAGAAGAGATGCATGGTTGGGCCCTTGGATTCCTGGCAGGGATTGGTTTGAAGATGAACTTCGTGCTTTTCAAGGTGGAAAAGATGGGCTCATGCCTTGTGTATGGGATAATGAAATTTCAAGTTTTCCTCAGCCGGAGACTATAAATAAGATGTTGGTAAATCAAATGCTTCTGTGCTTTGGAATCATATTTGCTGCTCAGAATAGTGGTGGTATGCTGTCACTTCTTGGAATGATGGAGCAGTGTCTAAAAGCTGGGAAAAAGCAACCATGGCATGCTGCAAGTGTAACCAATATATGTGTGGGGTTACTTGCTGGGTTGAAGGCTTTGCTCGCGTTACGTCCGCAATCATTAGAGTTAGAGATATTAAATTTGGCTCAAGCTATTTTTAAGGGTATACTAATTGAGGGAGACATTTGTGCATCACAACGTAGGGCATCATCAGAGGGTCTTGGTCTTTTAGCTCGCCTTGGAAGTGATATCTTCACAGCCAGGATGACTCGATTGTTGCTTGGAGAGCTAAATGGTATAACAGATTCAAATTATGCTGGCTCAATTGCTCTTTCCCTAGGATGTATTCATCGCAGTGCTGGAGGGATGGCACTGTCAACTTTAGTGCCTACTACCGTGAGCTCAATTTCTTTGCTGGCTAAAAGTGCAATCCCTGGCTTACAGATCTGGTCTTTGCATGGACTTCTTTTGACTATTGAAGCTGCTGGCTTGTCCTTTGTATCTCATGTCCAGGCAACACTTGGCCTTGCTTTGGAGATTTTGCTGTCCGAGGAGATAGGAAGGGTTGACCTTCAGCAAGGTGTGGGACGCCTTATAAATGCAATAGTTGCCGTTCTTGGTCCTGAGCTTGCCTCTGGCAGCATTTTCTTTTCACGCTGCAAGTCTGTTATTGCGGAGATTAGTTCCTCACAAGAAACAGCTACAGTACTTGAGAGTGTCCGTTTTACACAACAGCTTGTTCTTTTTGCACCACATGCTGCTTCGGTGCACTCCCATGTCCAAACTCTTCTGCTGACTCTGTCATCAAGACAGCCGATGTTAAGGCATCTTGCAGTCTCCACTGTACGACATCTCATTGAGAAGGACCCTGTTTCCATTATTGATGAACAAATAGAAGATAATCTGTTTCGAATGCTAGATGAAGAAACTGATTCAGAGATAGGGAATTTAATCCGTGGTACTATCATACGACTACTTTACGTCTCTTGCCCTTCACATCCTTCTCGTTGGATATCAATTTGTCGTAACATGGTTCTTTCTATGTCAACGAGAGCAACTGCTGAGATTAGTAAAGGTTCAGGAAATGATTCAGTTAGTGGTCCAGATGGTGACTCAAGGTTAAATTTTGGAGACGATGACGAAAACATGGTCTATAGTTCTAAAAATATGTTTCAAGGTCATGCATTTGAAGCTTCTAATGTTGGTTGTAATAGAGATAAGCACCTCAGATACCGAACCAGAGTTTTTGCTGCTGAGTGCTTGAGTTATCTACCAGAAGCTGTTGGAAAGAATCCTGCTCATTTTGATCTTTCTTTAGCAATGAGAAAAGTTGCAAATGGACAGGCCTATGGTGATTGGCTAATCCTCCAAGTTCAAGAGCTAATATCAGTTGCTTATCAGATAAGCACAATTCAGTTCGAAAACATGCGTCCAATTGGTGTTGGACTTCTAAGTTCAGTTGTAGACAAGTTTGAAACGGTTGTTGACCCTGAACTTCCAGGACATGTTCTACTAGAACAGTATCAAGCACAACTAATATCTGCTGTTCGCACTGCACTGGACACATCATCCGGCCCTATTCTTTTGGAGGCAGGTCTGCAGCTGGCTACTAAGATAATGACAAGTGGAATAATTAGTGGTGATCAAGTTGCAGTAAAACGCATATTTTCACTAATATCACACCCGCTGGATGACTTCAAGGACCTATATTATCCTTCATTTGCAGAATGGGTCTCATGTAAGATCAAGGTAAGACTTCTAGCTGCTCATGCCTCTCTCAAGTGTTATACTTATGCATTCTTGAGGAGACACCAAGCTGGGGTTCCTGATGAGTATCTAGCATTACTACCATTGTTTTCAAGAAGTTCAAGCATTCTGGGGAAGTACTGGATCTGGCTTTTGAAGGACTATTGTTATATATGCTTGCGCTTAAATCTCAAAAGAAATTGGAATTCATTCCTTGATGCAATTCAAGCACGTCTGGTCTCATCAAAGTTGAAGCCTTGTTTAGAAGAAGCTTGGCCAGTTATTTTGCAAGCACTTGCTCTTGATGCAGTTCCTGTGAATGTTGATAGGATTGGAAACTCAGAAGCTGCTGTTGAAAATATATCAGTAAACAGCTTAGTATCTGGCTACAGTATGGTTGAATTGGAATCTGAAGAATACCAATTTCTGTGGAGCTTTGCATTGCTTGTTCTATTTCAGGGACAACATCCAGCATTTTGTAAACAAATTATACCATTAGCGTCCAGTAAAGCAAAACATGAAGAAGATTCTCCCTCTGAGGACATGAATTCTCCAGGCTTGAAGTTTTATGAAATTGTTTTGCCAGTTTTCCAGTTTCTTCTAACTCAGAAGTTTTTCTCAGCTGGTTTTCTTACTGTGAATATCTGTGAAGAACTGCTCCAGGTTTTCTCTTATTCTATCTACATGGATAATTCATGGAACAGTCTTGCAATATCTGTTTTATCCCAGATTGTGCACAACTGCCCTGAAGATTTCCTTGGAGCAGAAAATTTCACTTGTCTGGTGGTGGAACTTTGTGTGGGTTGCCTTTTTAGAGTTTATAACTGTGCTAGTGCAATCTCACTTGATCAAGCGGACTGGGAGGATTTAATTTCACCACTATTTATTGCTACGAAGACCATCATGAGGCGTTCTGAACCAAAAAAGCAATTGAATTCAGTAGCGCTGGCATTTCTGTTGATTGGTTACAAATTCATTAGGCAAGCTTCAACTGAGCTATCCCTCTCAAAAGTTACTGATTTTGTGAAGAGTGTGAATTCCTTTTTGAAGAAACTTATTGATGATGCTTCTAAGCTTGGTGATGATGCCATTGTCAACCAGAGAACAATTTTGTGCACTTCTCTAAATGAGATTGCTGGTTTGACTAAGGATTGCATTGAAGGGATTTGTCTACTGCATAATAAGAGATCTGACTTGCGCAAACTACTGCTATTGAAGCTTGCATTCTCTATGGAACAGATAATTATATTGCCCAAGATAATGCTGGAAATTCAATGTCTAGAAGGCAACAAAGATAGTGATCCTATCTATTTCTCGGTGTTTAAATTTTGCACTAATTGTATGCAAACTATACTAAATGACTCAAATGTACAGGTTCAGGCAATCGGGTTGCAGGTGCTGAAAAGCATGGTGCAGAAAAGCAGCACTGTAGAAGACAATAGTTCCATAATATTCATTATTGGAGAGCTGGTTGGGGACATTCTCACCATAATAAAAAACACATTAAAGAAACCCATGACTAAAGAATCAGTTGCTATTGCTGGGGAATGCTTACAAGTTCTAATGCTCCTGCAAACACTGTCAAAAGGGAGTGAATGCCAGAGACGGTTCATGAGTCTCCTTCTGGAAGCCATTCTTATGATCTTCTCAGCATTAGAGGATGATTGTTCTCAGGAAGTCAATGACATAAGAAGCACTGCCCTGAGGCTTGTTTCTCATCTTGCTCAAATTCCTTCTTCTGCTGATCATCTCAAGGATGTCTTGTTATCAATGCCTGAGATGCACAGACAGCAACTCCAGGGGGTTATTCGTGCTTCTGTAACACAGGACCACGGTGCAGCACAAATGAAATCTATGTCACCAGCATTAGAGATTAAACTACCAGTGCCAGTGGAAGGAAGGAAAGAGGACAATTTCCTATCAGCAGCCACTCAAGTAAAGCTTAAACAGCAAAGTGAAGAAAGTGATTTACCTCCATCAGCCAACCCTATAAACACCAATAATGATGACATGGAAGAAGATGAAGAGGACGAAGATGATTGGGACACCTTCCAGTCTTTTCCGGCCTCAAAGAATACGGCTGAAAGTGATTCTGTAGTTGAAAATGTTGCAAAAGACCCAGGCCCTGATGAGAACTCTTCTGCTTTGGAAATTGGTACTGTTGACTTTGAACAACATCCGTCTGCTGAAAATCTCAGTAATGTAGAGACTACTAATGCAGAGCATTCAGAGTTTCCAGCAGATATAATATCTGATGGCTCAGGTGACAGAGGTAAGATGGAACTACTTGACTCTCTATCAAACCCTGTTATTGATCCTCATGAAAATCAAGATCGAGAAGGAAACAAGGAACTAATATCAAGCACCGATAGTGAGGCCAGAGAAGTTCCAAATAACGGCAATGAGAAAATGTCATCTGATCTTCAAGTTGTTGAAGATGTGAAAGTATCATCTGTAGAGATAGAGGATTATGAGCGGAGAAGGGACAACCCAGTAGCCTCAACTGAACCTCGACATAGTGAAGGTGATGAAGGATCAGTCAATGCAGTCGAGGATCATGAGCACCAAGAGGAGAGTCCTGATAATAAAGTTGATGCGTCACATGCTCAGGCTCCTGAAGGGCTTGCTGGTAACGAAGCCAAAGAGGAAGCTGAAGGTGAAATTTATCAGTTACAGAATAAGGAAGCTGGTGAAGATGTTAGAGAAAGAACGGAGAATAAGAGCAATGTGCAGGAAAGAGAGAGCCAAGATAACTTGGAACCACCAAACAAGGAAGCGGATAAAGCTAATTTAGAATCTGGTGAGGGAATTGATAAGATATGA |
Protein: MARRNYVRENVPLSRFGVLVAQLESIVASASQKSPDPLLCFDLLSDLLSALDDEPKESILLWQRKCEDALYSLLILGAKRPVRHLASVAMGRIISKGDSISIYSRASSLQGFLSDGKRSEPQRIAGAAQCLGELYRHFGRRITSGLLETTIIATKLMKFHEEFVRQEALLMLQNALVGSGGSAAASAYTEAFRLITRFAIGDKAFVVRIAAARCLKAFANIGGPGLGVGELDSLASNCVKALEDPITSVRDAFAEALGSLIALGMNPEAQVQPRGKGPFPPAKKLEGGLQRHLALPFTKASSIRSKDIRVGLTLSWVFFLQAIRLKYLHPDIELQNYALNVMDMLRMDMSVDAHALACVLYILRVGVTDQMTEPTQRSFTVFLGKQLQSPEASPSMKIAALRTLSYTLKTLGEVPHEFKEVLDNTVVAAVSHSAQLVRVEAALTLRALAEVDPTCVGGLISYGVTTLNALRESVSFEKGSNLKVELDSLHGQATVLAALVSISPKLPLGYPARLPKSVLEVSRKMLTEFSRNAATAMVEEEAGWLLLSSLLSAMPKEELEDQVFDILSLWADLFSGNPEDVIRQSGDLQSRIRVWSAAIDALTSFVRCFVSSNSTISGILLQPVILYLNRALSYISLLAAKEQPNIKPAMDVFIIRTLMAYQSLPDPMAYRSDHSRIIQLCTVPYRNASGCEESSCLRFLLDRRDAWLGPWIPGRDWFEDELRAFQGGKDGLMPCVWDNEISSFPQPETINKMLVNQMLLCFGIIFAAQNSGGMLSLLGMMEQCLKAGKKQPWHAASVTNICVGLLAGLKALLALRPQSLELEILNLAQAIFKGILIEGDICASQRRASSEGLGLLARLGSDIFTARMTRLLLGELNGITDSNYAGSIALSLGCIHRSAGGMALSTLVPTTVSSISLLAKSAIPGLQIWSLHGLLLTIEAAGLSFVSHVQATLGLALEILLSEEIGRVDLQQGVGRLINAIVAVLGPELASGSIFFSRCKSVIAEISSSQETATVLESVRFTQQLVLFAPHAASVHSHVQTLLLTLSSRQPMLRHLAVSTVRHLIEKDPVSIIDEQIEDNLFRMLDEETDSEIGNLIRGTIIRLLYVSCPSHPSRWISICRNMVLSMSTRATAEISKGSGNDSVSGPDGDSRLNFGDDDENMVYSSKNMFQGHAFEASNVGCNRDKHLRYRTRVFAAECLSYLPEAVGKNPAHFDLSLAMRKVANGQAYGDWLILQVQELISVAYQISTIQFENMRPIGVGLLSSVVDKFETVVDPELPGHVLLEQYQAQLISAVRTALDTSSGPILLEAGLQLATKIMTSGIISGDQVAVKRIFSLISHPLDDFKDLYYPSFAEWVSCKIKVRLLAAHASLKCYTYAFLRRHQAGVPDEYLALLPLFSRSSSILGKYWIWLLKDYCYICLRLNLKRNWNSFLDAIQARLVSSKLKPCLEEAWPVILQALALDAVPVNVDRIGNSEAAVENISVNSLVSGYSMVELESEEYQFLWSFALLVLFQGQHPAFCKQIIPLASSKAKHEEDSPSEDMNSPGLKFYEIVLPVFQFLLTQKFFSAGFLTVNICEELLQVFSYSIYMDNSWNSLAISVLSQIVHNCPEDFLGAENFTCLVVELCVGCLFRVYNCASAISLDQADWEDLISPLFIATKTIMRRSEPKKQLNSVALAFLLIGYKFIRQASTELSLSKVTDFVKSVNSFLKKLIDDASKLGDDAIVNQRTILCTSLNEIAGLTKDCIEGICLLHNKRSDLRKLLLLKLAFSMEQIIILPKIMLEIQCLEGNKDSDPIYFSVFKFCTNCMQTILNDSNVQVQAIGLQVLKSMVQKSSTVEDNSSIIFIIGELVGDILTIIKNTLKKPMTKESVAIAGECLQVLMLLQTLSKGSECQRRFMSLLLEAILMIFSALEDDCSQEVNDIRSTALRLVSHLAQIPSSADHLKDVLLSMPEMHRQQLQGVIRASVTQDHGAAQMKSMSPALEIKLPVPVEGRKEDNFLSAATQVKLKQQSEESDLPPSANPINTNNDDMEEDEEDEDDWDTFQSFPASKNTAESDSVVENVAKDPGPDENSSALEIGTVDFEQHPSAENLSNVETTNAEHSEFPADIISDGSGDRGKMELLDSLSNPVIDPHENQDREGNKELISSTDSEAREVPNNGNEKMSSDLQVVEDVKVSSVEIEDYERRRDNPVASTEPRHSEGDEGSVNAVEDHEHQEESPDNKVDASHAQAPEGLAGNEAKEEAEGEIYQLQNKEAGEDVRERTENKSNVQERESQDNLEPPNKEADKANLESGEGIDKI |